Discretization of variational regularization in 
Banach spaces 

Christiane Poschl * Elena Resmerita J and Otmar Scherzer * 

April 19, 2010 

Abstract 

Consider a nonlinear ill-posed operator equation F(u) = y where F is 
defined on a Banach space X. In general, for solving this equation numer- 
ically, a finite dimensional approximation of X and an approximation of 
F are required. Moreover, in general the given data y 5 of y are noisy. In 
this paper we analyze finite dimensional variational regularization, which 
takes into account operator approximations and noisy data: We show 
(semi-)convergence of the regularized solution of the finite dimensional 
problems and establish convergence rates in terms of Bregman distances 
under appropriate sourcewise representation of a solution of the equation. 
The more involved case of regularization in nonseparable Banach spaces 
is discussed in detail. In particular we consider the space of finite total 
variation functions, the space of functions of finite bounded deformation, 
and the I/°°-space. 

Key words: Ill-posed problem, Regularization, Bregman Distance, 
Strict Convergence 

1 Introduction 



Let F : X — >• Y be a nonlinear operator with domain T>(F), where X is a 
Banach space and Y is a Hilbert space. We would like to approximate solutions 
of the ill-posed equation 

F(u)=y (1) 

via variational regularization. 



*Dept. of Information & Communication Technologies Universitat Pompeu Fabra C/ 
Tanger 122-140, 08018 Barcelona, Spain 

t Department of Industrial Mathematics, Johannes Kepler University, AltcnbergerstraBe 69, 
A-4040 Linz, Austria, elena.resmerita@jku.at 

* Computational Science Center, University of Vienna, NordbergstraBe 15, A-1090 Vi- 
enna, Austria and Radon Institute of Computational and Applied Mathematics, Altenberg- 
erstrafie 69, A-4040 Linz, Austria otmar. scherzert@univie.ac. at 



1 



Let 1Z : X — > [0, +00] be a penalty functional with nonempty domain T)(1Z). 
An element u € ~D(F) fl T>(1Z) is called an IZ-minimizing solution of ([1} if it 
solves the constraint optimization problem 



mmlZ(u) subject to F(u) = y. 



(2) 




(3) 



In order to solve equation ([T]) numerically, the space X has to be approxi- 
mated by a sequence of finite dimensional sub-spaces X n . The situation when 
the spaces X and Y are Hilbert and the regularization is quadratic has been 
analyzed in [23], [2(5] (linear problems) and [25], [27] (nonlinear problems). How- 
ever, recent advances in regularization theory deal with general Banach spaces. 
Although convergence of regularization methods in the general setting has been 
established (see, e.g., [29]. [18], a unifying discretization approach is still not at 
hand. In comparison with the Hilbert space theory a significant complication 
is due to the fact that a non-separable Banach space cannot be approximated 
by a nested sequences of finite dimensional subspaces, with respect to the norm 
topology. We have in mind the case of the space of bounded variation functions 
BV or of bounded deformation functions BD, which are not separable. However, 
as mentioned in |3j page 121] in the context of BV, "the norm topology is too 
strong for many applications" , and in particular also when considering finite 
dimensional approximations. 

In this work, we show how ill-posed nonlinear equations can be approximated 
by solving associated finite dimensional convex regularization problems. The 
case of nonseparable Banach spaces is particularly emphasized. We propose 
to approximate such spaces X by subspaces with respect to a topology which 
is weaker than the norm topology. Instead of the norm topology on X (as in 
the separable Hilbert space or in the separable Banach space setting) we use 
a metric on X, which requires to have available an adequate superspace of X. 
Our investigation covers a large class of not necessarily separable Banach spaces 
which are frequently used for regularizing imaging and other inverse problems. 

The paper is organized as follows. Section[2jspecifies the assumptions, shows 
well-posedness and convergence of the discretized regularization method in the 
case that the Banach space X is nonseparable and the regularization term is a 
not necessarily convex function. Also, convergence rates with respect to Breg- 
man distances are obtained in the convex regularization setting, under a stan- 
dard source condition. The finite dimensional approximation of solutions of 
the equation in separable Banach spaces X is briefly discussed in Section [3] 
following the analysis done in Section [2] Section |3] studies in some detail the 
discretization of several relevant nonseparable Banach spaces such as the space 
of bounded variation functions, the space of bounded deformation functions and 
the space of essentially bounded functions. The inverse ground water filtration 
problem is analyzed in the natural setting of L°°(fi), in Section [5] 
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2 The case of nonseparable Banach spaces 



2.1 Main assumptions 

Let X be a not necessarily separable Banach space which can be embedded into 
a separable Banach space Z. Let r be a topology on X which is weaker than the 
norm topology on X. Therefore, we refer to r as the weak topology. In addition 
let 1Z : X —> [0, +oo] be a proper functional. 

We define a metric on the space X by the norm of Z induced on X and a 
pseudometric generated by the function 1Z: 

d(u,v) = \\u-v\\ z + \ll(u)-1l(v)\. (4) 

We shall denote by Ta the topology generated by this metric. Relating to the 
above discussion, this is an intermediary topology between the norm topology 
on X and the weak topology r. 

The following elementary result gives us a motivation for approximating a 
nonseparable Banach space in a topology that is weaker than the norm topology. 

Proposition 2.1. A Banach space X is separable if and only if there exists a 
nested (increasing) sequence of finite dimensional subspaces {X n } such that 



where the closure is considered with respect to the norm topology of X. 

Consider a sequence of nested, finite dimensional subspaces {X„} which 
satisfies ^ 

Un^Xn = X; (5) 

That is, X is the closure of the reunion of the subspaces X n with respect to the 
topology of the metric d. This property holds for many nonseparable Banach 
spaces - see several examples in Section [4] 

We are given approximation operators F m of F, which have the same do- 
main T>(F) as F. We assume that the operators F, F m satisfy the following 
approximation property: 

\\F(u) - F m (u)\\ < p rn for all u € V{F) n V{K) . (6) 

Here, the constant p m should only depend on m and satisfy 

lim p m = . 

m— >oo 

Denote 

D n := X n n V(F) n V(K), n e N, 

and assume that the sets D n are nonempty. We are interested in approximating 
7?.-minimizing solutions of equation ([T]) by solutions u^i s n £ D n of the problem 

min | ||F m (u) — y 5 \\ 2 + aR(u)\ subject to u € D n . (7) 
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In order to pursue the analysis, we make several (standard) assumptions on 
the spaces X, Y, the operator F, the functional 1Z (see also ([29l [TH [30]), and 
on the approximations X n , F m as well: 

Assumption 2.2. 1. The Banach space X is provided with a topology r such 
that 

• The topology is finer than the topology r. 

• The norm topology is finer than the topology Td- 

2. The domain T>(F) is T-closed. 

3. The operator F : T>(F) C X — > Y is sequentially T-weakly closed. That 
is, {uk} C T>(F), Uk — » u and F(uk) — > v imply u G T>(F) and v = F(u). 
Moreover, the operator F is continuous from T){F) C Z with the norm 
topology to (F, || • ||) 

4- For every m € N, the operator F m is sequentially r-weakly closed. 

5. The function 1Z is sequentially r - lower semi-continuous. 

6. For every M > 0, a > and every m, n € N, the sets 

{u G X n : \\F{u)f + aU{u) < M} (8) 
are r -sequentially relatively compact. 

7. For every u G X, there exists some u n G X n such that d(u n ,u) — > as 
n — > oo. 

2.2 Well-posedness of the discretized problem 

We emphasize again one of the main ideas in this work: When discretizing a 
nonseparable Banach space, one could work with topologies which are weaker 
than the original norm topology and which might be more natural than the lat- 
ter. A well-posedness result for problem ([7]) can be stated now in this theoretical 
setting. 

Proposition 2.3. Let m,n G N and a, S > be fixed. Moreover, let assump- 
tions \2.2\ and ©, © &e satisfied. 

Then, for every y s G Y , there exists at least one minimizer wo/©. 

Moreover, the minimizers of © are stable with respect to the data y s in 
the following sense: if {yk}k&i converges strongly to y s , then every sequence 
{uk}keN of minimizers of (O where y s is replaced by yk has a subsequence 
{u;}igN which converges with respect to the topology r to a minimizer u of ^ 
and such that {TZ(ui)}i^ converges to lZ(u), as I — > oo. 
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The proof is analogous to the one for Theorem 3.23 in [3D], which in turn 
generalizes a proof in [14] from Hilbert to Banach spaces. 

In the sequel, we prove (semi)convergence of the finite dimensional regular- 
ization method. 

Theorem 2.4. Let assumvtions \2.2\ be satisfied. Moreover, assume that: 

(i) Equation flj) has an IZ-minimizing solution u in the interior ofT>(lZ) D 
T>(F), considered in the norm topology; 

(ii) v n € D(F) for n sufficiently large, where v n G X n and d(v n ,u) — > as 
n — > oo; 

(Hi) The parameter a — a(m,n,S) is such that a — > 0, — —> 0, — > 

and 

WEM^yl^o (9) 

as 8 — > and m,n — > oo . 

If JjSp, (0j also hold, then every sequence {uk}, with Uk '■= u^ s £ k an d &k '■= 
a(mk, nk, 8k) where 8k — > 0, nk — > oo, rrik — s> oo as k — >• oo and Uk is a solution 
°f (2Pj ^ as a su bsequence {u{\ which converges with respect to the topology r to 
an IZ-minimizing solution u of equation |7]) and such that {7?.(u/)}; e N converges 
to TZ(u), as I — > oo. Moreover, if u is the unique solution of [1$), then the entire 
sequence {uk} converges to u in the sense of t and 1Z as above. 



Proof. From (g]) and Assumption ET21 Item [7] it follows that 7Z(v n ) -> TZ(u) for 
n — > oo. From the definition of u^j' 5 n , the estimate ([3]) and (O, it follows that 

" V 5 \\ 2 + a n<t) < \\F m (v n ) -y 5 f + aU(v n ) 
< (\\F m (v n ) - F(v n )\\ + \\F(v n ) - F(u)\\ + \\F(u) - y s \\) 2 + aTZ(v n ) 

< {p m + \\F(v n ) - y\\ + 8f + aK(v n ). (10) 

Therefore, 

a 

Assumption (iii) now guarantees that 

limsup^«f n )<W(u). (11) 

Observe that Ymin^aa \\F(v n ) — F(u)\\ — since F is continuous at u (compare 
Assumption 12.21 Item 131) and (i), (ii)). By (iii), the quantity (p m + \\F(v n ) — 
F(u)\\ + 8) 2 in (fTUj) converges to zero as 8 — > and to, n — > oo, and then 

lim||F TO (u°f B )-tf 5 ||=0. 

Therefore, by applying again (|6]) and ([3]), we also have that 

F{u%%) y, (12) 
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with respect to the norm of Y, as S, a — > and m, n — > oo. Consider a k '■= 
a(m k ,n k ,8 k ) and u fc := v%*fy h . Note that {||F(u fc )|| 2 + a k lZ(u k )} is bounded. 
Hence, by (JBJ and by the compactness hypothesis in Assumption l2.2[ there exists 
a subsequence which is r-convergent to some u £ D n . Due to the lower 

semicontinuity of TZ and (fTTj) , we get 

7£(u) < liminf lZ(uj) < Umsup7£(wj) < TZ(u). 

We also have that = y, as j — > oo due to (fT2|) and Item [4] in Assumption 

12.21 Therefore, u is an TZ minimizing solution of equation ([1]) and 

TZ(u) = lim lZ(uj). 

If the solution u is unique, the only limit point of {u k } with respect to r and 7?. 
is u. □ 

Remark 2.5. In some situations, convergence of a sequence {u k } to u with 
respect to the topology r and such that lZ(u k ) — > lZ(u) implies d{u k ,u) — > as 
k — » oo. This happens for BV and BD which are embedded into Z = L , and r 
is chosen as the weak* topology, 1Z is the total variation and total deformation 
seminorm, respectively - see Section [7} 



2.3 Convergence Rates 

In this section, we establish error estimates for the approximation method we 
analyze, with respect to Bregman distances. To this end, assume throughout 
this section that the regularization functional 7Z is convex. 

Recall that the Bregman distance with respect to a possibly non-smooth 
convex functional 1Z is defined by 

D n (v, u) = {D^(v, u):£e d1Z{u) ^ 0}, u, v £ V(K), 

where 

D^(v, u) = K{v) - K(u) -fav-u). 

More information about Bregman distances and their role in optimization and 
inverse problems can be found in 28 . Error estimates for variational or iterative 
regularization of ([T]) by means of a non-quadratic penalty have been shown in 
[51 [2B1 ESI 13 ES] • The Bregman distance D-ji associated with 1Z was naturally 
chosen as the measure of discrepancy between the error estimates. 

We assume Frechet differentiability of the operator F around u which is 
considered to be in the interior of T>{F) n T>{1Z); moreover, assume that its 
extension to the space Z is also Frechet differentiable around u. In fact, our 
study is based on the following source-wise representation: 

There exists u) G Y such that 

£ = F'(u)*u) € d1Z(u), (13) 
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and on the following nonlinearity condition (see also |29j): 
There exist e > and c > such that 

\\F(u) - F(u) - F'(u)(u -u)\\< cD n {u, u), (14) 

is satisfied for all u £ ~D(F) D U e (u) with respect to the above subgradient £ and 
such that 

c|M| < 1 . (15) 

Here 

Dji(u, u) = lZ(u) — lZ(u) — (£, u — u), 
where £ € dlZ(u) satisfies (fT5)l . Let us denote by 

ln ;=\\F'(u)(v n -u)\\, (16) 

A„ := D n (v n ,u). (17) 

Here {v n } is a sequence as in Theorem l2.4l 

In the following we derive a relation between the Bregman distance and the 
metric d at u. Observe that 

D n (v ni u) = TZ(v n ) - K(u) - (u,F'(u)(v n - u)). 

One has lZ(v n ) — lZ(u) — > since d(v n ,u) — > 0, as n — > oo (see (|1|)). Moreover, 
F'(u)(v n — u) — > 0. As a consequence, lim.n_j.oo A„ = 0. Thus, convergence with 
respect to the metric d is stronger than convergence with respect to the related 
Bregman distance. 

Theorem 2.6. Suppose that A s sumvtion 1 2. 2i the assumptions in the previous 
result, inequalities 0) ; \1S\) and hold. Moreover, assume that p m — 0(6 + 
A n +7n); with X n , 7„ given by |J7| ) ; &16\) . If a ~ max{5, A„, 7„}, then 

D n (uZl%,u) = 0(S + X n + 7n ). (18) 



Proof. We have 

ll^n«d) - y'H 2 + ^ \\ F M - y s \\ 2 + aTZ(v n ) 

< (\\F m (v n ) - F(v n )\\ + \\F(v n ) - F(u)\\ + \\F(u) - y s \\) 2 + aU{v n ) 
< ( Pm + \\F(v n ) - F(u) - F'(u)(v n -u)\\ + \\F'(u){v n - u)\\ + 5f + aJl(v n ) 

< {Pm + c\ n + 7n + s ) 2 + atR.(v n ) . 

Denote 

Pn ■= {p m + c\ n + ln + 5f . (19) 
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We use HU) and get 

\\F m «%) -y S \\ 2 + aD n (u%%, u) < (3 n + aTZ(v n ) - aR{u) - a(£, u a m % - u) 

= (3 n + aD n (v n , u) - a(£, u^ n - v n ) 
= P n + aX n - a{cj, F'(u)(u% s n - v n )) 
= fi n + a\ n - a(w, F'(u)(u^ s n - u)) 

+a(uj, F'(u)(v n - u)) 
< (3 n + aX n + ac\\u\\D n (u'^ s n ,u) 

+a\\u\\\\F(u^ n )-F(u)\\+a\\u\\ ln . 

Therefore, 

\\F m (u%%)-y s \\ 2 + a(l-c\\<j\\)D n (u%* n ,u) < p n + a\ n + a\\uj\\(( n + ln ), (20) 

where ( n = \\F m (u%%) - y\\. Due to (JISJ), the term a(l - c\\uj\\)D R (u^ s n ,u) is 
non-negative. Therefore, 

\\Fm(v%*n)-V S \\ 2 </3n+«A„ +a|| W || ((n + In) ■ (21) 

Using we have 

£ < (ll^«„) - y 5 \\ + h s - y\\) 2 < 2\\F m {vtf n ) - /f + 28 2 . 

This together with inequality ([2~Tj) implies 

£ < 2/3„ + 2a\ n + 2a\\u)\\( n + 2a|M| 7n + 2S 2 , 

which yields 

C„ < 2a||w|| + (2a 2 ||a;|| 2 + 2S 2 + 2f3 n + 2aX n + 2a||w|| 7 „) 1/2 . (22) 

From p0|. it follows that 

a(l - c\\uj\\)D n {u^f nl u) < (3 n + aX n + a\\u}\\£ n + a||w||7„, 

with Q n estimated above. Using (fH)|) and taking a ~ max{<5, A„, 7„} yield the 
above convergence rate. □ 

3 The case of separable Banach spaces 

Let X be a separable Banach space. Consider a nested sequence of finite di- 
mensional subspaces X n , n£N, such that 



where the closure is considered with respect to the norm topology of X. By 
letting Z — X, a weak topology r on X, a regularization functional 1Z : X — > 
[0, +oo] and using the assumptions employed for the results in Section [2] one 
obtains stability and convergence results similar to Proposition l2.3l and Theorem 
12.41 Moreover, if 1Z is convex, then convergence rates can also be established. 
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Remark 3.1. In some situations, convergence of a sequence {uk} to u with 
respect to the topology r and such that lZ(uk) — > 1Z(u) implies \\uk — u\\ — > 0, 
as k — > oo . This is the case of locally uniformly convex reflexive Banach spaces 
when t is chosen as the weak topology on X and 1Z = || • || p , with p G (1, +oo) 
such as Hilbert spaces, L p spaces, W m,p spaces, but also in L 1 when r is the 
weak topology and 1Z is the Shannon entropy (see f^). 



Sparsity regularization 

Let {4>i} be an orthonormal basis of L 2 (f2). Denote by X the Banach space 
£ 2 which is identified with the functions with bounded £ 2 Fourier coefficients. 
Let X n be the linear span of the first n Fourier modes. 

For sparsity regularization one usually takes TZ{u) — X^I u «Ij where u = 
Ui<fn. Thus, we consider the regularization method of minimizing the func- 
tional 

u^\\F(u)-y s \\ 2 +aJ2M, 

i 

In this case, the topology r on X is the weak topology of £ 2 . The regular- 
ization results apply also in this setting. 

Another example of regularization term which promotes sparsity is 

n(u)=J2Wi\ p , p 6(0,1). (23) 

i 

This setting with X = £ 2 and r taken as the weak topology of £ 2 is also covered 
by the regularization theory analyzed in this work. Moreover, convergence of a 
sequence {u^ to u with respect to the topology r and such that lZ(uk) — > 7Z(u) 
implies convergence of {uk} to u relative to the quasinorm (|2"3"]l . as k — > oo, cf. 

HZ]. 



4 Particular nonseparable spaces 

4.1 The bounded variation functions space 

Recall that, for a bounded Lipschitz domain fi C R w and for a given N € N, 
the space BV(il) of L 1 (f2)-functions of bounded variation mapping into K 
can be defined as the set of functions w e L 1 (f2) such that the total variation 
of w is finite, that is, 

J \Dw\ p = sup jy w(x)ijj(x)dx : ip G Cj(O), \i>(x)\ pl < 1 for all x e o| < oo. 

Here, |-| , denotes the l p i vector norm where p' = p/(p— 1) is the conjugate 
exponent to p. In particular we are interested in the cases p = 1,2, where 
lp' = loch- The case p = 2 corresponds to isotropic total variation. 
Let us recall several properties of the space BV(fl): 
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• It is the dual of a separable Banach space (see [5J Remark 3.12]) when 
provided with the norm 



• The space BV{Q) is continuously embedded in 2/(17), where 1 < r < jtzt- 

We consider the setting X — BV(ft), with r being the weak* topology on 
BV(ft), and Z = L^Q). 

The functional 1Z is the total variation seminorm. Consider 



The metric d is the metric used also in [5], which gives the so-called strict 
convergence, according to 3 . A similar idea is developed in [3TJ[52]. Moreover, 
the strict convergence of a sequence to u is equivalent to convergence 

with respect to the topology r together with L |Z)tijfe|i — > f Q \Du\i as k — >• oo, 
since weak* convergence of a sequence {uk} to u in BV(Q) is equivalent to 
boundedness of {||wfc|| bv)} together with convergence of {uk} to u in 2/(17) - 
see, e.g., Proposition 3.13 in [3]. 

The choice of the vector norm in the definition of the bounded variation 
seminorm is of special importance for approximating the BVsp&ce by subspaces 
consisting of piecewise constant functions. Assume that {17.,} is a decomposition 
of 17, and consider the following finite dimensional spaces: 



When considering a partition of 17 into uniform parallelepipeds we can only 
guarantee the density assumption ([5]) when considering the Zi-norm in the defi- 
nition of B V - see [5] . If one wants an isotropic behavior of the regularization 
term one has to consider the li norm in the definition of the i?l^-seminorm. 
The problem is that in the case of uniform parallelepipeds (for instance pixels 
in imaging), there is no convergence with respect to this isotropic £?l^-seminorm. 
One has to consider a more general partition of the domain f2, that allows to 
approximate level lines with any direction. One idea is to use an irregular trian- 
gulation. These observations have been made by [S] and [5], here we only state 
their main result, concerning anisotropic and isotropic total variation: 

Theorem 4.1 ( [HI IS] ) • Let Sic R™ be a polygonal domain and let ho > 0. 

• Given u G BV(tt), then there exists a family {th : < h < ho} of trian- 
gulations of fl such that the mesh-size of th is at most h, and functions 
Uh G A^, where A Q h denotes the space of piecewise constant functions cor- 
responding to the triangulation th, such that 





\Dv\ 



v 
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• Given u 6 BV(Q). Let {Oj} &e a decomposition offl, into parallelepipeds, 
such that the maximal length of a parallelepiped is smaller then h. Then 
there exist functions Uh € V®, where V® denotes the space of piecewise 
constant functions corresponding to the partition {fli}, such that 

\\u-u h \\ Ll + 



\Du\ 



->■ 



as 



Another type of approximation of X is the ' J M -approximation property' em- 
ployed in |T5] which, adapted to our notation reads as follows: 

X n is a $ Q -approximation of X if, for each u G X, there exists a sequence 
{u n } C X n such that \\u — u n \\z — > and $ a (u n ) —} § a (u) asn-> oo, for any 
a > 0, where <E> Q is given by 



F(u) 



V 



otR{u) 



In |15j one aims at approximating minimizers of $ Q (which depends on a) in 
X by minimizers of $ a in X n for a fixed a > , while our aim is to approx- 
imate solutions of the operator equation by minimizers of $ a in X n when the 
regularization parameter a depends on the dimension n. 
We summarize the results of this section in an example 



Example 4.2. Let X = BV(Cl) with the weak* topology r. 
Z = L 1 (J7) and let d be the metric 



d(u,v) = \\u - v\\ L i {n) + 



\Du\- 



\Dv\ 



Moreover, let 



(24) 



Let F m and F : T>{F) C BV(il) —> L 2 (Cl) satisfy the related conditions in 
A ssumvtion 1 2. £l 

Then according to Theorem ^. 1\ for every u £ BV(£l) there exists an approx- 
imating sequence of piecewise constant functions. Consequently, minimization 
of the discretized regularized problem is well-posed, stable, and convergent (cf. 
Proposition \2.3\) . The piecewise constant regularizers {umn} approximate the 
TZ-minimizing solution u in the sense of the metric [2$ . 



4.2 The bounded deformation functions space 

In the following let Q = (0, 1) N the open unit cube. We choose the simple 
geometry not to be forced to take into account approximations of Q, or irregular 
meshes, for the finite element method considered below. 

The space BD(Vi) [33] of functions with bounded deformation in an open 
set ft C R N is defined as the set of functions u = (u 1 , . . . , u N ) e L 1 (Q; R N ) 
such that the symmetric distributional derivative 

EijU := —{DiVp + Djv}) 
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is a (matrix- valued) measure with finite total variation in fi: 

BD(Vt) := (u G L 1 (f2; R N ),Eij(u) G Mi(f2),z,j = l,...,Jv|, 

where Mi(f2) denotes the space of bounded measures. BD(Vl) is a nonseparable 
Banach space provided with the norm 

N 

\\ U \\bD = IMIl,i(fi;K") +Y. / l s «( u )l • 

This space is strictly larger than the space of bounded variation functions 
BV(Q; R N ). It was introduced in [3T] and [23J and has been widely considered 
in the literature (see [33] [33J) in connection with the mathematical theory of 
plasticity. Several interesting properties of the BD space are as follows: It is 
the dual of a separable Banach space (see [33]); the space BD(Q.) is contin- 
uously embedded in L P (VL\ R ), where 1 < p < -Jzzt- In addition the space 
BD{Vi; R n ) is not separable - if it would be, the space BV(fl; R N ), which 
is the subspace of BD(Q; R N ) where all components it 3 , j = 2, . . . , N vanish, 
would be as well. However, this is not true as stated already in the previous 
example. 

Let us return to the hnite dimensional regularization framework we investi- 
gate in this work. 

Consider the setting X = BD(fl), r the weak* topology on BD(Vt) and 
Z = L 1 (i7; R )■ We associate with n G N the discretization size h n '■= ^, use 
multi-indices a := (a\, . . . , and set A := {0, . . . , n} N . 

We consider the hnite-dimensional product space of piecewise linear splines 

f N 

I a£Ak=l 

where £ a € h n {0, 1, . . . , n} N , and A is the following function 

N 

A(x) := JJmax(0,l- \x l \) . 
i=i 

This finite element discretization has already been used in |10j for numerical 
minimization of variational energies. For x G (0, h n ) N the derivative of A((x — 
£ a )/h n ) in direction ej is given by 

D > A (Hr) - *<* - ^ n (- (»• 1 - ) • 

For every a with a; < n and for every k,l < k < N define A k a := {/3 G A a ,/3 k = a 
Additionally define il a as the iV— dimensional cube, spanned by the vectors 
+ e kh n } k=1 ,„ N , and i a = ljf =1 A^, (see Figured]). 
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Sa+(0,1) Sa+(1,1) 




A a = {a, a + (0, l),a+ (l,0),a+ (1, 1)} 
A l a = {a, a + (0,1)} 
Al = {a, a +(1,0)} 



+ (1,0) 



Figure 1: an example for the sets A c 



Moreover, for /3 £ A a we have 

x ~ £/3 



^ h 



\+(4f) if^ =a» + i 



(25) 



In the following we prove the main result on a pseudometric. 



Theorem 4.3. We assume that h n — > w/ien n — > oo. Then for every u £ 
BD(il, M. N ) D L r (Q,, R ),1 < r < oo, we can find a sequence {u n }, with 
u n £ X n , such that 



lim / \u — u n \ r dx = and lim / |£/ii n | = / \Eu\ 
Jn ,woc Jn Jn 



Setting 



d(u ll u 2 ) = \\ui - u 2 \\ L i 



Eui\ - / \Eu 2 \ 
n Jn 



(26) 



we obtain lirrin^oo d{u n , u) = 0. 



In order to prove this theorem, we need some additional facts on BD, given 
by the following Lemmas. 



Lemma 4.4. For every n £ N the inclusion X n C BD(il, WL N ) holds and for 
each u = (u 1 , . . . , u ) £ X n , 



N 



a k=l 



E 



JV-1 



N 1 

EE^ 



E V u /3+e, ".^ ' V",d + C 



Proof. From the definition of it G X m we obtain 

/in 



AT-1 
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Moreover, since f2 = U . a <n f2 a , we can split up the integral. When integrating 
D l u k +D k ui over fl a , we only have to sum over those (3 for which A 



Hence we only sum over (3 £ A a in the inner sum. 
I ' \D lU k + D k u l \=J2 I \Diu k + D k u l \ 



= E 



E I^aa 



+ u l pD k A 



h n 



Next we use (125D and obtain 



^2 



iV-l 



The lemma follows from summing over all a and all /, k = 1, . . . , N. 
Lemma 4.5. 1. If {«„} C BD(VL) and u n — > it m (i 1 (r2)) A ', £/ien 



□ 



< liminf / |_Ei,tt 



for every u G BD{Vt) (1 L r (57), 1 < r < oo, i/iere exists a sequence {u n } C 



lim / lit — m„ | r oLt = 



lim / |^w n | 



Proof. 1. Follows from standard properties of convex measures. 
2. See [31 Theoreme 3.2, Chapitre II]. 



□ 



Now we are ready for the proof of Theorem 14.3 



Theorem \4-3[ The /^-convergence of the picewise polinomial functions u n can 
be found in the book of Ciarlet Due to Lemma T4.51 we can assume that 

u e C°°(Q). Set the coefficient of u n as {u n )a ■— u k (£ a ), then we have 



JV 
k—l ol 

From Lemma 14.41 it follows that 



x-£ c 



\E k iU n \ 



\D lU k +D k u l n 



E (^(W -"%))+ E WW-^)) 

f3eA' a peA* 



N-l 



2 
(27) 
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Next we use the mean value theorem: For m,n 6 {l,k} ,m n we can find 
points T[Q m „ between £^ and Cs+e™ such that 

« fc (£/3+e,) - « fe (^) - KD lU k (r, 0}k}l ) , 
hence from (|2~T|) it follows that 



N 



N-l 



Summing over all a and Z, A; = 1, . . . N and taking the limit h n — > we have 
that 

lim 



\EijU n \ = 



\E tJ u\ 



□ 



We summarize the results of this section as follows: 

Example 4.6. Let X = BD(il) and r the weak* topology. Moreover, let Z 
L 1 (f2, K ) and let d be the metric 



d(u, v) 



\Eu\ 



\Ev\ 



(28) 



Let F m and F : T>(F) C BV(Q) — > L 2 (Cl) satisfy the related conditions in 
Assumvtion \2. 2\ 

Take h — 1/n. Then according to Theorem \4-3\ for every u G BD(Q, M. N )D 
L r (fl, R N ), 1 < r < oo i/iere exists an approximating sequence of piecewise 
constant functions in the sense of metric d. Consequently, minimization of the 
discretized regularized problem is well-posed, stable, and convergent according 
to Provosition \2.3\ The piecewise constant regularizers approximate the 

TZ-minimizing solution u in the sense ]hau^i s n = u in the weak star topology 



and lim J n | Eu 



s 

m,n\ 



In W 



4.3 The L°° space 

In this section we analyze the following regularization method 

u-H|F(u)-y 4 f + ttNloo ■ 

In the sequel we show that there exist finite dimensional subspaces {^ n } 
of L°°(f2) satisfying equality ([5]). More precisely, there exist finite dimensional 
subspaces {X n } of L°°(f2) such that for any u G L°°(r2) one can find u n G X n . 
n € N with 



lim (|| u„ - u\\ LP + |||u n ||oo - ||u||oo|) =0, p e [1, +oo). 



(29) 
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It is known that L°°(f2) is not separable, while L p (£l), for every p G [1, +00), 
is separable. However, every function in L°°(f2) can be approximated uniformly 
and thus, in the L°°(Q) - norm by a sequence of simple functions. The proof of 
this statement is constructive. However, it provides a nonlinear approximation, 
in the sense that the piecewise constant functions which approximate the L°°(f2) 
function do not yield linear subspaces - see, e.g., Q3J Section 3.2]. This does 
not fit the theoretical framework we consider here. An alternative is to consider 
approximations of L°°(f2) functions by piecewise constant functions in a weaker 
topology, as shown in the sequel. 

For the sake of simplicity let fi = (0,1)^. Assume that {%} is a decom- 
position of O in parallelepipeds with maximal diagonal length h n as in [5], and 
consider the following finite dimensional subspaces of L°°(n): 



X n = < u n = > u^xn, ■ u J G E , 1 < j < n 



The following results are essential in proving the main statement of this 
section: 

First denote by J e * u the mollification of u, for every e > 0. 

Theorem 4.7. [p. 36 J^J Let u be a function which is defined on M. N and 
vanishes identically outside Q. 

Ifue L p (Cl), then J e * u G C°°(M iv ), in fact J t * u G C°°(?T) and J e * u G 
L p (Vt), for every p G [1, +00). Also, 

lim II J, * u — u\\ v = 0, (30) 

|| J e * u|| p < ||u|| p , for all e > 0. (31) 
Lemma 4.8. For every n G N, one /ia,s X„ C L°°(f2) and, /or eac/i w n G A„ . 

||un||oo = max 

l<j<n 

Proof. Follows immediately from the definition of u n , taking into account that 
> 0, for 1 < j < rc. □ 

Lemma 4.9. If {u n } C L°°(f2) and u G L°°(Q) are such that \\u — u n \\ LP —> 
as n —} 00, for some p G [l,+oo) and ||ii„||oo < IMloo for every n G N, then 
{u„} converges weakly* to u and ||u n ||oo — ► IMloo as n — > 00. 

Proof. Since {||u n ||oo} is bounded, there exists a subsequence which con- 
verges weakly* to some v G L°°(fi), cf. Alaoglu-Bourbaki Theorem, [19l p. 
70]. By the definition of weak* convergence, one obtains that {ufc} converges 
also weakly, with respect to L p (fl) to some v. Therefore u = v. In fact, every 
subsequence of {u n } converges weakly* to u, which yields weak* convergence 
of the entire sequence {u n } to u. In addition, the weak* lower semi continuity 
of the L°° -norm implies ||tz||oo ^ lim inf^—^oo Hindoo* 

Thus the assertions are 

proved. □ 
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Note that the previous result can be relaxed by assuming only weak conver- 
gence of {u n } in 

Lemma 4.10. For every u G C oc (0), there exists a sequence {u n } with u n G X n 
such that 

lim (||u n - u\\ LP + |||m„||oo - ||u||oo|) = 0, pe[l,+oo). 

n— >oo 

Proof. Let £j be the gravity centers of the parallelepipeds flj. Define 

n 

i=i 

Then \\u n — u\\ LP — > - see, for instance the book of Ciarlet [T2]. Moreover, by 
using Lemma |4.8[ 

||wn||oo = max \u(£j)\ < max|u(x)| = \\u\\oo, for all n G N, 

l<j<n igfi 

where the last equality holds due to the continuity of u on f2. Thus, Lemma ITUl 
applies and yields |||it„||oo — IMIool — > as n — >• oo. □ 

Theorem 4.11. Assume that h n —> when n — > oo. Then for every u G L°°(f2) 
one can /ind ti„ € X n swc/i £/ia£ h29\) holds. 

Proof. Let m G L°°(f2) and p G [l,+oo). By Theorem 14.71 there exists {uj} C 
C°°(R Ar ), in fact in C°°(fi), with {u^} C L p (0) such that 

lim — = 0, and ||uj|| p < ||u|| p , for all j € N. 

j-s-oo 

By letting p — > +oo in the last inequality and using Theorem 2.8 on p. 25 in 
PP, one also has 

IK'Hoo < ||m||oo, for all j G N. 

Lemma 14.91 applies and yields ||%||oo — > IMloo as j —¥ oo. 

By consequence, every function in L°°(f2) can be approximated in the sense 
stated at (|2"9"j) by functions from C°°(f2). Since every function v G C°°(fi) can 
be approximated by it„ G X n as in ([29| due to Lemma 14.101 the conclusion 
follows immediately. □ 

Remark 4.12. One can define the subspaces X n in the previous theorem also 
by means of piecewise polynomial functions of degree no bigger than one in each 
variable, which are continuous on Cl. Moreover, one can employ n-simplices 
instead of parallelipipeds and piecewise linear functions, according to Remark 
3.8 in 0/. 

We summarize the results of this section as follows: 
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Example 4.13. Let X = L°°(£l) and t the weak* topology. Moreover, let 
Z = L p (£l), p 6 (1, +oo) and let d be the metric 

d{u,v) = \\u - u||iP(n) + IIMloo - ||w||oo| ■ (32) 

Let F m and F : T>(F) C BV(Q) — > L 2 (tt) satisfy the related conditions in 
Assumption \2.2\ 

Take h = l/n. Then according to Theorem \4-ll\ for every u G L°°(Q.) there 
exists an approximating sequence of piecewise constant functions in the sense of 
metric d. Consequently, minimization of the discretized regularized problem is 
well-posed, stable, and convergent according to Proposition \2.3\ The piecewise 
constant regularizers {it^ ;„} approximate the TZ-minimizing solution u in the 
sense lim.u%f n = u in the weak star topology and lim ||i4n^J|oo = Halloo- 



This convergence described in the previous sentence is weaker than con- 
vergence with respect to the metric (j32l) . The fact that these two types of 
convergence are not equivalent, by contrast to the BV and BD cases with the 
two corresponding convergence types, is shown by the following counterexample. 

Consider the Rademacher functions /„ : [0, 1] — > { — 1, 1}, 



/„(*) = (-iy +1 if x G 



1<«<2" 

On ' On ' 



This sequence converges weakly star to zero in L°°([0, 1]), but not in the i 1 ([0, 1]) 
norm. Moreover, consider g n : [0, 2] — > K, 

9n(t) = fn(t), if te [0,1] 

and g n (t) = 1 for t e [1,2]. Then {g n } converges weakly star to X[i,2]> tne 
characteristic function of [1, 2] in L°°([0, 2]) but not in the ^([O, 2]) norm. Note 
that \\g n \\oo = ||X[i,2]l|oo = !• Thus, {g n } is a sequence in L°°([0,2]) which 
converges weakly star to X[i,2] and such that limn—^oo Hffnlloo = ||X[i,2] ||oo> 

but 

lim„^ 

oo ( 1 1 5n X[l,2]||^i "T" HlflVilloo ||X[l,2]||oo 1)^0. 

Remark 4.14. Given a direct problem formulated in L°° . Is it worth to formu- 
late the inverse problem in L°° or is more appropriate to formulate the problem 
in L 2 , where we can get also L 2 -approximations? L 2 approximations are quite 
advantageous because in the L°° case, one might not even get convergence with 
respect to the L 1 -norm, as demonstrated by the above counterexample. However, 
it is sometimes desirable that the regularized solutions are guaranteed to belong 
also to the L°° space. Moreover, the extremal behavior of the solution can be 
evaluated by L°° regularization, since convergence of the L 00 -norm of the regu- 
larized solutions to the L°° norm of the true solution is achieved. Also, in some 
inverse problems high interest is given to estimating a linear functional of the 
solution rather than the solution - according to the mollifier idea in J4$, which 
corresponds to the weak-star approximation of our L°° regularization results. 
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5 The inverse ground water filtration problem 



We consider the problem of recovering the diffusion coefficient in 

-{au x ) x = f in fl, 
u(0) = = tt(l), 

with / £ L 2 [0, 1]. The operator F is dehned as the parameter-to-solution map- 
ping 

F : V{F) := {a £ L°°[0, 1] : a(x) > c> 0, a.e.} -> L 2 [0, 1], 
a i y F(a) := u(a), 

where u(a) is the unique solution of the above equation and c is a constant. 
More details about this ill posed problem can be found in [2U] and [TT| Chapter 
1] . Note that T)(F ) is a subset of the interior of the nonnegative cone of L°° [0,1]. 
In fact L°° is the natural function space when formulating this problem. How- 
ever, to the best of our knowledge, previous literature dealing with the problem 
from the regularization viewpoint has usually considered T>(F) in H 1 which is 
embedded in L°°, mainly due to the Hilbert space setting which is enforced by 
using H 1 . The operator F is Frechet differentiable from L°° to L 2 - see [50] . 
Note that T>{F) is closed and convex with respect to L 2 , so it is weakly closed 
in L 2 . This implies that T>(F) is weakly* closed in L°°. Similarly one can argue 
that the operator F is sequentially weakly*-weakly closed. 

In the sequel we are going to employ approximation operators F m as in [25] . 
Let Y m be the space of linear splines on a uniform grid of to + 1 points in [0, 1], 
which vanish at and 1. By using the variational formulation, let u m (a) £ Y m 
be the unique solution of 

{a(u m ) Xl v x ) L 2 = (f,v) L 2 for all v £ Y m . 

The operators F m are defined as 

F rn : V(F) := {a £ L°°[0, 1] : a(x) > c> 0, a.e.} -> i 2 [0, 1], 

a i-> F m (a) := u m (a), 
Then © holds for p m = mT 2 cf. 12, Theorems 3.2.2, 3.2.5], 

||F m (o) - F(a)\\ L 2 = ||u m (o) - u(a)\\ L 2 = 0(\\a\\ L °o ■ to -2 ). 

Then by choosing the discretization of L°°[0, 1] as in Sectional one obtains 
the convergence results of the discretized regularization method as in Example 
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